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REMARKS 



Claims 1-8 and 10-48 are pending in the present application, claim 9 being cancelled and 
new claims 13-48 added by the above amendment. The new claims are drawn to the same 
category of invention as the elected restriction group, so should be considered in the present 
application. Claims 10 and 1 1 were withdrawn by the Examiner after restriction, leaving claims 
1-8 and 12-48 under consideration. Claims 1, 3, 4, 7, 8 and 12 are amended by the present 
submission. Support for these amendments and new claims can be found throughout the 
application as filed, including at page 2, lines 27-29; page 3, line 26 through page 4, line 19; 
page 5, lines 2-5 and 21-24; page 8, line 23 through page 9, line 1; page 9, lines 9-31; page 10, 
line 3-4; page 13, lines 8-18; page 13, line 29 through page 14, line 3; page 16, line 29 through 
page 19, line 2; page 22, line 6 through page 26, line 23; figure 14; the claims as filed; and the 
sequence listing, inter alia. No new matter has been added. 

SEQ ED NO:l is a cDNA encoding what applicants have designated "GTAR14-1 
protein." The coding sequence corresponds to nucleotides 9-947 of SEQ ID NO:l (see, e.g., 
page 16, lines 12-16, Figure 14, and the Sequence Listing). SEQ ID NO:4 is the amino acid 
sequence of the GTAR14-1 protein. 

Objections to the Specification 

The Examiner objected to the title of the application as not descriptive (Office Action, 
page 3). The title has been amended to "HUMAN GTAR G-PROTEIN COUPLED 
RECEPTOR" at the Examiner's suggestion. 

Further, the Examiner objected to the specification as not fully in compliance with the 
sequence rules (Office Action, p. 3). Applicants respectfully direct the Examiner's attention to 
the amendments submitted on August 8, 2002 in response to the Notification of Missing 
Requirements, in which the specification was been amended to insert the sequence identifiers 
into the figure legends for figures 1-6, 10-19, and 24-31. Absent notification to the contrary, 
Applicants assume that the amendments contained therein were entered. The specification is 
further amended by the present submission to insert sequence identifiers for the primers shown in 
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Figures 4-6 and 16-19 into the appropriate figure legends. When a sequence is presented in a 
drawing, MPEP 2422.02 allows for the sequence identifier to be included in the Brief 
Description of the Drawings. Therefore, Applicants submit that the specification as amended 
does comport with the sequence rules, 37 CFR 1.821-1.825. 

Applicants submit that the specification is now compliant and request withdrawal of the 
objections thereto. 



Claim Objections 

Claims 1 and 12 have been amended to remove all reference to non-elected subject 
matter, as requested by the Examiner. Applicants request withdrawal of the objection to these 
claims. 

35 USC 101 and 1 12, first paragraph 

Claims 1-6 and 12 were rejected under 35 USC § 101 "because the claimed invention 
lacks a credible, specific and substantial utility." Office Action, p. 4. Applicants respectfully 
traverse. 

The utility requirement is a relatively low threshold. From the MPEP, § 2107.1 : 

. . .as the Federal Circuit has stated, "[t]o violate [35 U.S.C.] 101 the claimed 
device must be totally incapable of achieving a useful result." Brooktree Corp. v. 
Advanced Micro Devices, Inc., 977 F.2d 1555, 1571, 24 USPQ2d 1401, 1412 (Fed. Cir. 
1992) (emphasis added). See also E.I, du Pont De Nemours and Co. v. Berkley and Co. , 
620 F.2d 1247, 1260 n.17, 205 USPQ 1, 10 n.17 (8th Cir. 1980) ("A small degree of 
utility is sufficient . . . The claimed invention must only be capable of performing 
some beneficial function ... An invention does not lack utility merely because the 
particular embodiment disclosed in the patent lacks perfection or performs crudely ... A 
commercially successful product is not required . . . Nor is it essential that the invention 
accomplish all its intended functions ... or operate under all conditions . . . partial 
success being sufficient to demonstrate patentable utility ... In short, the defense of 
non-utility cannot be sustained without proof of total incapacity." If an invention is 
only partially successful in achieving a useful result, a rejection of the claimed invention 
as a whole based on a lack of utility is not appropriate. See In re Brana , 5 1 F.3d 1560, 34 
USPQ2d 1436 (Fed. Cir. 1995); In re Gardner , 475 F.2d 1389, 177 USPQ 396 (CCPA), 
reh'g denied, 480 F.2d 879 (CCPA 1973); In re Marzocchi, 439 F.2d 220, 169 USPQ 367 
(CCPA 1971). 
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Furthermore, the MPEP exhorts the Examiner to "focus on and be receptive to assertions made 

by the applicant that an invention is 'useful 5 for a particular reason." MPEP § 2107.01. The 

Guidelines for Examination of Applications for Compliance with the Utility Requirement 

(MPEP § 2107, referred to herein as "the Utility Guidelines") further state: 

Office personnel are reminded that they must treat as true a statement of fact made by an 
applicant in relation to an asserted utility, unless countervailing evidence can be provided 
that shows that one of ordinary skill in the art would have a legitimate basis to doubt the 
credibility of such a statement. 

As further explained by the Utility Guidelines, 

If the applicant has asserted that the claimed invention is useful for any particular 
practical purpose (i.e., it has a "specific and substantial utility") and the assertion would 
be considered credible by a person of ordinary skill in the art, do not impose a rejection 
based on lack of utility. A claimed invention must have a specific and substantial utility. 
This requirement excludes "throw-away," "insubstantial," or "nonspecific" utilities, such 
as the use of a complex invention as landfill, as a way of satisfying the utility requirement 
of35U.S.C. 101. 

The Applicants' assertions of utility hardly sink to the level of use as a landfill. 

The present specification is rife with examples of the utility of the claimed invention. For 
example, the claimed GTAR14-1 nucleic acids and proteins exhibit tissue-specific expression 
patterns. mRNA for GTAR14-1 is observed only in the thymus during fetal development, and in 
testis, pancreas, and lymphatic/hematopoietic tissues including thymus and spleen in adults (see 
the specification at page 2, lines 32-35; page 6, lines 6-18; page 39, lines 22-25; and Fig. 7). All 
of these organs and tissues are known to be play a role in regulation of the lymphatic and 
endocrine systems. Thus, at the very least, one of skill in the art would appreciate that the 
claimed GTAR14-1 nucleic acids and polypeptides can be used as tissue markers (detected by, 
for example, immunohistochemical detection of antibodies generated against the claimed 
polypeptides or fragments thereof, or in situ hybridization using labeled fragments of the claimed 
polynucleotides) for those specific tissues, and can be used for developmental studies including 
cell fate experiments, for example, to evaluate the effects of a drug on the development of the 
immune system. This provides a well-established utility. To illustrate this point, Applicants 
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note that a similar approach can be seen in Forster et al., Blood 84(3):830-840 (1994) (abstract 
attached hereto as Exhibit A ) which describes the use of BLR1 as a marker for memory T cells. 
The marker is detected using a BLR1 -specific monoclonal antibody. Other examples include 
Cornish et al., B. J. Exp. Pathol. 70(5):495-504 (1989)) (abstract attached hereto as Exhibit B V 
which describes the use of antibodies to various B- and T-cell subsets to detect changes in the 
levels of those subsets associated with the induction of Heymann nephritis. Rocha et al, Blood 
89(6):2189-202 (1997) (abstract attached hereto as Exhibit C \ describe the use of monoclonal 
antibodies to distinguish donor from host cells, to differentiate between subsets of T 
lymphocytes, and stain certain macrophages, after systemic transfer of allogeneic antitumor 
immune T lymphocytes. Clearly, one of skill in the art would consider the similar information 
provided in the present specification a credible assertion of utility. 

The tissue-specific expression pattern exhibited by GTAR14-1 provides a reasonable 
basis for a role of GTAR14-1 in immune function, including lymphoid development, e.g., the 
differentiation, proliferation or activation of lymphatic cells (see, e.g., page 6, lines 6-27). One 
of skill in the art would consider it reasonable and credible to suggest that GTAR14-1 is useful in 
screening for new targets and therapeutics for the modulation of immunity. The use of 
GTAR14-1 nucleic acids and polypeptides to screen for agents that modulate 
lymphatic/hematopoietic function or development is described in detail in the specification, see, 
e.g., pages 16-20. The specification at page 3, lines 21-25, describes the use of the claimed 
proteins in methods of screening for the natural ligand of GTAR14-1, i.e., a novel hematopoietic 
factor, or a novel hormone-like peptide factor. Both of these disclosures provide a substantial 
and specific utility. 

The Examiner dismisses the use of the polypeptides in the generation of GTAR14-1 
antibodies as "not specific or substantial. . . [I]f the specification discloses nothing specific and 
substantial about the polypeptide, both the polypeptide and its antibodies have no patentable 
utility." Office Action, page 6. As noted above, the GTAR14-1 polypeptides are useful at least 
as tissue markers; antibodies generated using the polypeptides as antigens are useful in detecting 
GTAR14-1 in tissues, and thus are useful in tissue identification and developmental analysis. 
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The claimed polypeptides, and antigenic fragments thereof, are useful to generate such 
antibodies, and additionally as positive controls for the antibodies. 

The Examiner further makes the assertion that "[n]o well-established utility exists for 
newly isolated, complex biological molecules." Office Action, p 4. Applicants respectfully 
request that the Examiner explain what statutory or case law language supports such a 
sweepingly conclusory statement. If by "newly isolated" the Examiner means 
"uncharacterized," Applicants respectfully point out that this certainly does not describe the facts 
of the present case. If instead the Examiner means the term to apply to all novel genes and 
proteins, whether or not they are characterized, Applicants respectfully note that such a position 
is entirely inconsistent with U.S. practice. Genes and proteins are without question patentable 
under U.S. law. Clarification is requested so that Applicants can respond appropriately. 

The Examiner is further reminded that 

The PTO has the initial burden of challenging a presumptively correct assertion of 
utility in the disclosure. . . . Only after the PTO provides evidence showing that one of 
ordinary skill in the art would reasonably doubt the asserted utility does the burden shift 
to the applicant to provide rebuttal evidence sufficient to convince such a person of the 
invention's asserted utility. 

In re Brana . 51 F.3d 1560, 34 USPQ2d 1436 (Fed. Cir. 1995) (emphasis added). Applicants 
have provided sufficient characterization of the nucleotide and amino acid structure, significant 
structural homology to a relatively-well characterized group of proteins (the G-protein coupled 
odorant receptors), and expression pattern of these "newly isolated, complex biological 
molecules" to allow one of skill in the art to identify a number of possible uses, explicitly 
disclosed in the specification or not, that are certainly "well-established" enough to satisfy the 
requirements of 35 USC § 101. The Examiner has provided no evidence to contradict the 
Applicants' assertions of utility. 

For the foregoing reasons, Applicants submit that the claimed invention possesses 
sufficient utility, and respectfully request withdrawal of the rejection under 35 USC § 101. 
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Claims 1-9 and 12 were rejected under 35 USC § 1 12, first paragraph, "since the claimed 
invention is not supported by either a specific and substantial asserted utility or a well 
established utility for the reasons set forth above, one skilled in the art clearly would not know 
how to use the claimed invention." Office Action, pages 7-8. Claim 9 has been cancelled. 
Applicants respectfully traverse the rejection as it may apply to the remaining claims. 

For the reasons given above, Applicants submit that the claimed invention does possess 
utility. In addition, the specification provides substantial guidance as to how to use the invention 
as presently claimed. See, e.g., Example 1, which describes the isolation and analysis of 
GTAR14-1. The specification teaches the sequence (i.e., the primary structure) of the 
GTAR14-1 polypeptide, and of a nucleic acid encoding that polypeptide. Methods of making 
recombinant polypeptides, and methods of using the polypeptides for the generation of 
antibodies, are described in detail. The references cited by the Examiner deal with the 
difficulties of predicting function based solely on homology to an annotated sequence in a 
database. The predictions of function included in the present application are not based solely on 
homology to annotated sequences; first, the sequences to which GTAR14-1 has homology are 
proteins with proven function, members of a described family, not simply sequences randomly 
grouped by automated correlation software. GTAR14-1 has significant homology to members of 
the olfactory receptor gene family (see, e.g., Fig, 1), has seven transmembrane regions 
characteristic of G-protein coupled receptors (see Figs. 10 and 11), and has amino acids known 
to be essential for function of G-protein coupled receptors (see page 5, line 10 to page 6, line 1). 
The data regarding the function of these homologs comes not from database annotation, but from 
empirical observations of the function of the olfactory receptors. Furthermore, the predictions of 
function are supported by details of the tissue-restricted expression patterns of GTAR14-1. One 
of skill in the art, given this information and the other information included in the application as 
filed, would be able to use the claimed polypeptides and polynucleotides, e.g., as probes to detect 
tissue-specific markers, and in screens for novel hematopoietic factors, or novel hormone-like 
peptide factors, e.g., modulators of immunity. 
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The Examiner additionally stated that "the specification does not reasonably provide 
enablement for all fragments and variants of SEQ ID NO:4" (Office Action, page 10; italics in 
original). Claim 3, which referred to functional fragments of the protein of claim 1, has been 
amended to refer to antigenic fragments of the polypeptide of claim 14: i.e., SEQ ID NO:4. 
Claim 1 has been amended to claim an isolated G protein-coupled receptor protein selected from 
the group consisting of (a) a polypeptide comprising the amino acid sequence of SEQ ID NO:4; 
and (b) a polypeptide comprising the amino acid sequence of SEQ ED NO:4, in which one to 
thirty amino acids are deleted, added, inserted, or substituted by another amino acid residue. 
This substantially limits the fragments and variants, both structurally and by function (since each 
protein covered by claim 1 must have the function of a G-protein coupled receptor). Applicants 
submit that the requirements of 35 U.S.C. § 101 and § 1 12, second paragraph, are amply 
satisfied. 

35 USC § 1 12, First Paragraph - Written Description 

Claims 1-6 were rejected as "containing subject matter which was not described in the 

specification in such a way as to reasonably convey to one skilled in the relevant art that the 

inventor(s), at the time the application was filed, had possession of the claimed invention." 

Office Action, pages 11-12. In particular, the Examiner stated that: 

The specification as originally filed does not provide adequate written description 
for an isolated protein wherein one to thirty amino acids of the receptor protein are 
modified or deleted, while still maintaining the function of the receptor. 

Office Action, page 12. The Applicants respectfully traverse and direct the Examiner's attention 
to the specification as originally filed at page 8, line 1 1 through page 9, line 1, especially page 8, 
line 25, as well as claim 1 as originally filed. The rejection seems to be based on the fact that the 
applicants did not disclose the results of functional assays on the GTAR14-1 protein. Applicants 
note that the sole function recited in claim 1 is "G-protein coupled receptor protein." This 
general class of receptor proteins has been extensively studied in the art. A great deal is known 
about the structure and function of various structural domains of receptors in this class. Thus, 
one of ordinary skill in the art would know that certain residues of SEQ ID NO:4 (e.g., 
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hydrophobic residues in one of the seven transmembrane domains characteristic of the class) can 
be substituted with similar residues, with little likelihood of losing the "G-protein coupled 
receptor" function of the protein (see, e.g., the pre-October, 1998 studies reviewed in 
Gershengorn and Osman, Endocrinology 142(1):2-10 (2001) (attached hereto as Exhibit D ); also 
see Ji et al., J. Biol. Chem. 273(28):17299-17302 (1998) (attached hereto as Exhibit E) \ 
Likewise, it is known that intracellular domains of different G-protein coupled receptors can be 
swapped, with the resulting hybrid receptors retaining their general G-protein coupled receptor 
characteristics. (See Postina et al., Adv. Exp. Med. Biol. 449:371-85 (1998) (abstract attached 
hereto as Exhibit F T). 

It is generally known that proteins are "highly tolerant" of amino acid changes (see, e.g., 
Bowie et al. (1990) Science 247:1306-1310 (copy enclosed as Exhibit G ). At page 1306, lines 
12-13, Bowie teaches that "proteins are surprisingly tolerant of amino acid substitutions." Bowie 
et al. cites as evidence a study carried out on the lac repressor. Of approximately 1500 single 
amino acid substitutions at 142 positions in this protein, about one-half of the substitutions were 
found to be "phenotypically silent," i.e., had no noticeable effect on the activity of the protein 
(Bowie, at page 1306, col. 2, lines 14-17). Presumably the other half of the substitutions 
exhibited effects ranging from slight to complete abolishment of repressor activity. Thus, one 
can expect, based on Bowie et al. f s teachings, to find over half (and possibly well over half) of 
random substitutions in any given protein to result in proteins with full or nearly full activity. 
These are far better odds than those at issue in In re Wands , 858 F.2d 731 (Fed. Cir. 1988), in 
which the court said that screening many hybridomas to find the few that fell within the claims 
was not undue experimentation. Based on Bowie et al.'s teachings, one would predict that even 
random deletions, additions, insertions, or substitutions of residues in SEQ ID NO:4 will 
predictably result in a majority of the variants having full or partial receptor activity. Given what 
is known about G protein coupled receptors in general, one of ordinary skill in the art could 
readily make intelligent decisions about what positions to alter and which to leave untouched in 
order to preserve the claimed function. The Examiner has presented no evidence to the contrary. 
Applicants submit that the claims satisfy the requirements of 35 USC § 1 12, first paragraph. 
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Claims 8 and 9 were also rejected by the Examiner for lack of written description. 
Claim 9 has been cancelled, obviating the rejection as to that claim. Claim 8 recites a method of 
screening for a compound that binds to a G protein-coupled receptor protein, the method 
comprising (a) exposing a test sample to the protein of claim 1, and (b) selecting a compound 
that binds to the protein. New claims 41-43 further specify the nature of the protein used in the 
method of claim 8 (in claim 41, the protein comprises SEQ ED NO:4, in claim 42, the protein 
consists of SEQ ID NO:4, and in claim 43, the protein is a fusion protein comprising SEQ ID 
NO:4 and another peptide or polypeptide). Ample support is found in the specification for the 
use of the claimed proteins in simple binding assays; see, e.g., page 16, line 29 to page 20, line 4, 
which describes methods that include Western blotting, yeast two-hybrid assays, and affinity 
chromatography. None of these methods, nor claim 8 (or 41-43), requires specific knowledge of 
the function of the protein. 

The Examiner stated: 

The specification as originally filed does not provide adequate written description 
for a method of using a receptor protein with an unknown function to search for specific 
ligands. This limitation is not expressly asserted, nor does it flow naturally from the 
specification as originally presented. 

Office Action, page 13. Applicants respectfully traverse, as the specification does expressly 

disclose methods of assaying for ligands without knowing the specific function of the receptor 

protein. See, e.g., page 16, line 29 to page 20, line 4, describing methods including 

immunoprecipitation, yeast two-hybrid, and Western blotting. None of those methods require 

any foreknowledge of the specific function of the protein. As mentioned on page 17, lines 5-14, 

the bound compound may be the receptor's ligand. 

Finally, the Examiner further stated that 

The specification as originally filed does not provide adequate written description 
of the method now claimed. The specification teaches the receptor polypeptide of SEQ 
ID NO:4. However, functional assays of the receptor were not performed. It is not 
known what class of ligands to use, what tests to perform, nor what tissues to use, for 
example. 
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Office Action, page 13. Applicants point out that the written description requirement has never 
required that the claimed method have been "performed" in order to be patentable. Nor is it 
necessary to disclose the "class of ligands" to use, as that "class" is not limited in the claim. Any 
class of compound can be screened. (Though in that regard, Applicants point out that the 
specification does describe several possible categories of substances to screen: "peptides, 
purified or crude preparation of proteins, non-peptide compounds, synthetic compounds, 
products of microorganism fermentation, cell extracts, animal tissue extracts, marine organism 
extracts, and plant extracts" (see page 19, lines 17-21), As described above, ample description of 
"what tests to perform" can be found in the specification. For example, as noted above, the 
disclosure at page 16, line 29, to page 20, line 4, describes methods that include Western 
blotting, yeast two-hybrid assays, and affinity chromatography. Tissues that can be used are also 
described, as the tissues in which GTAR14-1 is expressed are amply described in the 
specification (see, e.g., page 2, lines 32-35; page 6, lines 6-18; page 39, lines 22-25; and Fig. 7), 
as are any number of heterologous expression systems (see, e.g., page 10, line 22 to page 14, 
line 34). For the foregoing reasons, Applicants submit that claims 8 and 41-43 satisfy the written 
description requirement of 35 USC § 1 12, first paragraph. 

35 USC § 1 12, Second Paragraph - Indefiniteness 

Finally, the Examiner rejected claim 12 as being indefinite for the use of the term "under 
highly stringent conditions." The Examiner indicated that this rejection could be overcome by 
supplying specific conditions, supported by the specification, which the Applicants consider 
"stringent." Office Action, page 14. Claim 12 has been amended to recite "hybridizes at 65°C in 
2x SSC and 0.1% SDS." Applicants submit that this is sufficient to overcome the rejection under 
35 USC § 112, second paragraph. 

For the foregoing reasons, Applicants submit that the claims as amended satisfy the 
requirements of 35 USC §§ 101 and 112, and respectfully request withdrawal of the pending 
rejections and allowance of the claims as amended. If the Examiner feels that a telephone 
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conference would be useful to expedite prosecution, she is hereby invited to telephone the 
undersigned at 617-956-5985. 

Enclosed is a check for the Petition for Extension of Time fee and a check for the Excess 
Claim fee. Please apply any other charges or credits to deposit account 06-1050, referencing 
attorney docket no. 14875-075001. 

Respectfully submitted, 

Date:^^^? 



Fish & Richardson P.C. 
225 Franklin Street 
Boston, MA 02110-2804 
Telephone: (617) 542-5070 
Facsimile: (617) 542-8906 
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Related Articles, Links 



Expression of the G-protein— coupled receptor BLR1 defines mature, 
recirculating B cells and a subset of T-helper memory cells. 

Forster R, Emricta T, Kremmer E, Lipp M. 

Max-Delbruck-Center for Molecular Medicine, Berlin-Buch, Germany. 

The G-protein-coupled receptor BLR1 related to receptors for chemokines and 
neuropeptides has been identified as the first lymphocyte-specific member of the gene 
family characterized by seven transmembrane-spanning regions. Using a high-affinity anti- 
BLR1 monoclonal antibody (MoAb) and three-color flow cytometry it is shown that BLR1 
expression on peripheral blood cells is limited to B cells and to a subset of CD4+ (14%) and 
CD8+ (2%) lymphocytes. T cells expressing BLR1 were positive for CD45R0, were 
negative for interleukin-2 receptors, show high levels of CD44, and show low levels of L- 
selectin. The majority of CD4+ cells originating from secondary lymphatic tissue, but none 
of cord blood-derived T cells, express BLR1. These observations suggest that BLR1 is a 
marker for memory T cells. Furthermore, BLR1 expression was detected on all CD 19+ 
peripheral or tonsillar B lymphocytes, but only on a fraction of cord blood cells and bone 
marrow cells expressing CD 19, slgM, or slgD. Interestingly, activation of both mature B and 
T cells by CD40 MoAb and CD3 MoAb, respectively, led to complete downregulation of 
BLR1. These data suggest that the G-protein-coupled receptor BLR1 is involved in 
functional control of mature recirculating B cells and T-helper memory cells participating in 
cell migration and cell activation. 
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Immunoregulation in Heymann nephritis. I. Cell marker studies. 
Cornish J, Barabas AZ, Lannigan R, Rozing J. 

Department of Pathology, University of Calgary, Alberta, Canada. 

Immunoregulation was examined in rats with Heymann nephritis (HN), an established model 
of membranous glomerulonephropathy (MGN). There is little known of the cellular immune 
events for the induction and maintenance of the autoimmune response in HN. The cell 
marker studies utilized fluorescein (FITC)-labelled monoclonal antibodies directed to B cells 
(Mark-I), and T cell subsets: pan T (ER-I), helper/inducer T (ER-2) and suppressor/cytotoxic 
T (ER-3). Lymphoid subsets were compared in spleen, lymph nodes, peripheral blood and 
bone marrow, of normal and diseased rats. Animals were investigated during the induction 
and chronic phases of disease. The induction of HN was associated with an early, significant, 
but transient increase of the non-specific myeloid component of the defence system. 
Subsequently, a significant increase was seen in the number of cells of the B lymphocyte 
lineage in HN animals, which coincided well with the overall increased humoral immune 
responsiveness. No alterations in the T lymphocyte subsets were noted during the 
development of this experimental autoimmune disease. 
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Differences between graft-versus-leukemia and graft- versus-host reactivity. 
I. Interaction of donor immune T cells with tumor and/or host cells. 

Rocha M, Umansky V, Lee KH, Hacker HJ, Benner A, Schirrmacher V. 

Deutsches Krebsforschungszentrum Heidelberg, Abteilung Zellulare Immunologic, FSP 
Tumorimmunologie, Germany. 

Graft-versus-leukemia (GVL) and Graft-versus-host (GVH) reactions were compared after 
systemic transfer of allogeneic antitumor immune T lymphocytes from B10.D2 (H-2d; Mis 
(b)) into DBA/2 (H-2d; Mis(a)) mice. Before immune cell transfer, recipient DBA/2 mice 
were sublethally irradiated with 5 Gy to prevent host-versus-graft reactivity. Recipients were 
either bearing syngeneic metastatic ESb lymphomas (GVL system) or were normal, non- 
tumor-bearing mice (GVH system). We previously reported that this adoptive 
immunotherapy protocol (ADI) had pronounced GVL activity and led to immune rejection 
of even advanced metastasized cancer. In this study, monoclonal antibodies were used for 
immunohistochemical analysis of native frozen tissue sections from either spleen or liver to 
distinguish donor from host cells, to differentiate between CD4 and CD8 T lymphocytes, and 
to stain sialoadhesin-positive macrophages at different time points after cell transfer. The 
kinetics of donor cell infiltration in spleen and liver differed in that the lymphoid organ was 
infiltrated earlier (days 1 to 5 after transfer) than the nonlymphoid organ (days 5 to 20). 
After reaching a peak, donor cell infiltration decreased gradually and was not detectable in 
the spleen after day 20 and in the liver after day 30. The organ-infiltrating donor immune 
cells were mostly T lymphocytes and stained positive for CD4 or CD8 T-cell markers. A 
remarkable GVL-associated observation was made with regard to a subset of macrophages 
bearing the adhesion molecule sialoadhesin (SER+ macrophages). In the livers of tumor- 
bearing mice, their numbers increased between days 1 and 12 after ADI by a factor greater 
than 30. Double-staining for donor cell marker and SER showed that the sialoadhesin- 
expressing macrophages were of host origin. The SER+ host macrophages from GVL livers 
were isolated by enzyme perfusion and resetting 12 days after ADI, when they reached peak 
values of about 60 cells per liver lobule, and were tested, without further antigen addition, 
for their capacity to stimulate an antitumor CD8 T-cell response. The results of this 
immunologic analysis suggest that these cells in the liver function as scavengers of the 
destroyed metastases and as antigen-processing and -presenting cells for antitumor immune 
T cells. 
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ABSTRACT 

G pirotein-coupled receptors (GPCRs) represent the largest family 
of signal-transducing molecules known. They convey signals for light 
and many extracellular regulatory molecules. GPCRs have been 
found to be dysfunctional/dysregulated in a growing number of hu- 
man diseases and have been estimated to be the targets of more than 
30% of the drugs used in clinical medicine today. Thus, understanding 
how GPCRs function at the molecular level is an important goal of 
biological research. In order to understand function at this level, it is 
necessary to delineate the 3D structure of these receptors. Recently, 
the 3D structure of rhodopsin has been resolved, but in the absence 



of experimentally determined 3D structures of other GPCRs, a pow- 
erful approach is to construct a theoretical model for the receptor and 
refine it based on experimental results. Computer-generated models 
for many GPCRs have been constructed. In this article, we will review 
these studies. We will place the greatest emphasis on an iterative, 
bi-directional approach in which models are used to generate hypoth- 
eses that are tested by experimentation and the experimental find- 
ings are, in turn, used to refine the model. The success of this approach 
is due to the synergistic interaction between theory and experiment. 
{Endocrinology 142: 2-10, 2001) 



G PROTEIN-COUPLED receptors (GPCRs) represent the 
largest family of signal-transducing molecules 
known. For example, GPCRs comprise more than 4% of the 
genes in Caenorhabditis elegans. GPCRs convey signals for 
light and many extracellular regulatory molecules, such as, 
hormones, growth factors, and neurotransmitters, that reg- 
ulate every cell in the body. Dysregulation of GPCRs has 
been found in a growing number of human diseases, and 
GPCRs have been estimated to be the targets of more than 
30% of the drugs used in clinical medicine today. Thus, un- 
derstanding how GPCRs function at the molecular level is an 
important goal of biological research. 

GPCRs maybe grouped into three subfamilies that exhibit 
little sequence homology but appear to share the same over- 
all topology. In this review, we will focus on receptors in the 
largest subfamily, which comprises more than 90% of 
GPCRs, that includes rhodopsin (Rh) and receptors for small 
neurotransmitters, such as opioids and catecholamines, pep- 
tides such as bradykinin, GnRH, and TRH, and glycoprotein 
hormones. The two-dimensional topology of a typical mem- 
ber of this subfamily, a receptor for TRH, is illustrated in Fig. 
I 1 . GPCRs contain an extracellular amino terminus, three 
extracellular loops (ECLs), seven trartsmembrane-spanning 
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1 In this review, we elected to use the TRH receptor as the only GPCR 
for which the primary sequence is shown. Therefore, only for TRH 
receptor will the positions of specific residues be identified. For all other 
GPCRs, specific residues will be described as being part of a conserved 
sequence/ motif or as part of a particular receptor domain. 



helices (TMHs), three intracellular loops (ICLs) and an in- 
tracellular carboxyl tenninus. The specific domains within 
which different regulatory molecules bind to GPCRs, and the 
changes in a GPCR that constitute conversion of the receptor 
from an inactive to an active state are under intense inves- 
tigation. In general, evidence has been presented that small 
ligands bind primarily in the core of the TMHs, intermediate 
size peptides to the ECLs and TMHs, and larger peptides and 
proteins to the amino terminus, ECLs and, most likely, the 
TMHs of their specific receptors. Upon binding an agonist, 
it is thought that changes are induced in the conformation of 
the receptor, in particular, changes involving the relative 
positions of the seven TMHs. The movements of the TMHs 
are thought to cause changes in the ICLs leading to increased 
coupling to a heterotrimeric G protein(s) and G protein ac- 
tivation. These hypotheses, however, have only begun to be 
tested by direct experimentation. 

Recently (1), the three-dimensional (3D) structure of rho- 
dopsin has been resolved. In the absence of experimentally 
determined 3D structures for other GPCRs, one approach is 
to construct a model for the GPCR and to refine the model 
based on experimental results. Computer-generated models 
for many GPCRs have been constructed. A number of these 
models are available at http://swift.embl-heidelberg.de/ 
7tm/models/lin/index.html. The reliability of the predicted 
model is difficult to assess independently. However, its use- 
fulness is in providing a molecular basis for specific struc- 
tural hypotheses with concomitant functional correlates. 
Moreover, even when 3D structures are resolved experimen- 
tally, molecular models will be useful to generate hypotheses 
regarding the molecular details of GPCR function. Models of 
the TMH domains are constructed on the basis of a topo- 
logical homology, expressed as a general backbone template 
(see below). However, the specific sequence of the receptor 
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Fig. 1. A two-dimensional topology of the TRH receptor sequence. Two-dimensional topology of the TRH receptor with the extracellular space 
at the top and the intracellular space at the bottom. The residues highlighted in circles are sites of high conservation that were used to determine 
the helical boundaries and align the helices on the transmembrane template. The residues highlighted in squares constitute the transmembrane 
binding pocket and those highlighted in diamonds the residues that participate in anchoring TRH to the receptor surface and guiding it to the 
transmembrane pocket. 



determines local structure and interactions between proxi- 
mal residues. Therefore, the initial structure of a predicted 
model must be refined computationally by minimization of 
the initial structure to relieve the offending close contacts. Sub- 
sequently, dynamic simulations that efficiently sample config- 
urational space are used to refine the structure. The construction 
of loops in receptors encounters the difficulty of no available 
templates. Here, a simulated annealing approach is necessary 
to ensure proper folding of the loops and constant temperature 
simulations can be used to refine the initial fold. 

Some investigators used bacteriorhodopsin (Br), for which 
a crystal structure at 1.55 A resolution is now available (2), 
as a template for constructing the TMHs of GPCRs (3). Br, 
however, is not a GPCR even though it is a seven trans- 
membrane-spanning protein. Other investigators used a 
GPCR template constructed by Baldwin (4). This TMH tem- 
plate was based on a comprehensive analysis of hundreds of 
GPCR sequences and was shown to be consistent with many 
mutational analyses. This template agrees well with the re- 
cently reported crystal structure of Rh that was resolved to 
2.8 A (1). It is now clear that position and tilt of TMHs in Rh, 
and by inference in all GPCRs, are different from those in Br. 
Nevertheless, models based on both Rh and Br have proven 
useful in generating hypotheses that have been tested 
experimentally. 

In the majority of reports that describe computer simula- 
tions of GPCRs, the purpose of model construction was to 



describe the binding pocket for the ligand within the recep- 
tor. Models of ligand-receptor complexes were constructed 
incorporating experimental evidence of the specific receptor 
amino add residues with which the ligands were found to 
interact. These models are being used, for example, in efforts 
to discover GPCR agonist or antagonist drugs. Another 
important use of GPCR models is to generate hypotheses 
regarding the binding and signaling functions of these re- 
ceptors for experimental testing. That is, to direct experi- 
mentation to define the structure-function relationships of 
GPCRs. We think the most effective use of models, however, 
is as part of an iterative, bidirectional approach in which 
models are used to generate hypotheses that are tested by 
experimentation and the experimental findings are, in turn, 
used to revise and refine the model. The success of this 
approach is due to the synergistic interaction between theory 
and experiment. We will emphasize this last approach in this 
review. 

Initial Models of Ligand-GPCR Complexes 

Small ligands 

The prototypical complex for a small ligand and its re- 
ceptor is the ll<is-retinal/Rh complex. In contrast to all 
other ligand/receptor complexes to be described, 11-cis- 
retinal is covalently attached to Rh. The linkage is via a 
protonated Schiff base to a Lys in TMH7. As noted above, 
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11-ris-retinal/Rh is the only ligand/GPCR complex for 
which a structure has been resolved experimentally (1). The 
Lys in TMH7 that is attached to retinal lies deep within the 
bundle of TMHs and the entire ligand is within the TMH 
bundle with the six-membered /3-ionone ring closer to the 
extracellular aspect of Rh. The Schiff base is surrounded by 
residues from TMH1, 2, and 7 and the j3-ionone ring interacts 
with residues from TMH3 and 6. Models of ligand/receptor 
complexes for a number of small ligands and their cognate 
GPCRs have been constructed (Table 1). The TMH bundle of 
the receptor was based on either the Rh or Br template. In 
every case, the ligand was docked within the receptor based 
on experimental results that identified receptor amino acid 
residues which directly interact with the ligand. That is, in no 
case was a stable model of a ligand/receptor complex con- 
structed using a computer simulation independent of exper- 
imental evidence delineating the binding pocket. (This is true 
for ligand/GPCR complexes of peptide and protein ligands 
also.) A typical model of a GPCR for a small ligand, TRH 
(pyroGlu-His-ProNH 2 ), is illustrated in Fig. 2 and a close-up 
of the TRH/receptor complex is shown in Fig. 3. The details 
of our construction have been described elsewhere (5). TRH 
is predicted to interact with residues in TMH3, 6 and 7 within 
the extracellular part of the TMH bundle (see below). 

Peptide ligands 

As noted above, peptide ligands bind to GPCRs by inter- 
acting with the extracellular domains and TMHs. It is note- 

TABLE 1. Models of GPCRs and ligand/GPCR complexes 0 



Receptors for small ligands 


References 


Adenosine 


(51,52) 


Adrenergic 


(40,46,53) 


Cannabinoid 


(54) 


Dopamine 


(55) 


Formyl peptide 


(56) 


Histamine 


(22,23) 


Muscarinic 


(57) 


Opioid 


(24,58) 


Platelet-activating factor 


(59) 


Purinergic 


(28,60) 


Serotonin 


(25,49) 


TRH 


(18-21,26,27,37,47,50) 


Receptors for peptides 




Angiotensin 


(61,62) 


Bradykinin 


(12,63) 


Cholecystokinin 


(6,7,30) 


Endothelin 


(64) 


GnRH 


(10,11,35,36) 


Gn secretagog 


(31) 


Melanocortin 


(65) 


Melanin-concentrating hormone 


(32) 


Neuropeptide Y 


(66) 


Neurotensin 


(67) 


Oxytocin 


(43) 


Tachykinin 


(68) 


Vasopressin 


(8,9,33,34) 


Receptors for large polypeptides/proteins 




Chemokines 


(13,14) 


FSH 


(69) 


LH/CG 


(15-17,48) 



TyrlSl 



Ami imi - 



282 






a Due to space limitations, we were only able to provide a limited 
number of references for the models listed and could not include all 
models for GPCRs that have been constructed. 



Fig. 2. Model of TRH receptor. A side view of the three-dimensional 
model of the TRH receptor with the extracellular space at the top and 
the intracellular space at the bottom. The model includes the extra- 
cellular loops and the transmembrane helices. The residues shown 
are those involved in TRH binding. 



worthy that a disulfide bond links ECL1 to ECL2 (see Fig. 1). 
A typical example of a model of a peptide ligand/receptor 
complex is that of cholecystokinin-8, an octapeptide that is 
the most biologically active form of this peptide, with its 
receptor (6, 7). The interactions of cholecystokinin-8 with its 
receptor are predicted to involve residues in the amino ter- 
minus, ECL2, ECL3, and the extracellular aspect of TMH6. 
The nonapeptide arginine-vasopressin is predicted to bind to 
its cognate receptor by interacting with residues in ECL1 and 
TMH2, 3, 4, 6, and 7 (8, 9) and the decapeptide GnRH is 
predicted to bind to residues in all TMHs and ECL2 and 3 in 
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Fig. 3. A close-up of the transmembrane binding pocket of the TRH receptor showing TRH (yellow) in its bound conformation. TRH interacts 
with Tyr-106 in TMH3, Tyi-282 and Trp-279 in TMH6 and Arg-306 in TMH7. 



its cognate receptor (10, 11). A model for the binding of the 
nonapeptide bradykinin predicts that bradykinin binds to res- 
idues in TMH5, 6, and 7 near the TMH/ECL boundary, 
whereas a peptide antagonist binds to residues in TMH5, 6, and 
7 that are deeper within the TMH bundle (12). Different sites for 
binding peptide agonists, peptide antagonists and nonpeptide 
antagonists have been reported in a number of GPCRs. 

Large polypeptide I protein ligands 

As noted above, larger peptides and proteins bind to the 
amino terminus, ECLs and, most likely, the TMHs of their 
specific receptors. Chemokines are 70 to 100 amino acid 
polypeptides that bind primarily to the amino termini and 
ECL1 and 2 of their cognate receptors (13, 14). Models of 
receptors for glycoprotein hormones do not include the 
amino termini (15, 16), but it has been predicted that these 
ligands bind primarily to the amino termini and ECLs of their 
receptors (17). 

Use of Models to Generate Hypotheses of GPCR 
Structure and Function 

Small ligand binding pockets 

We have constructed a model of the TRH receptor on a 
generic template for GPCRs (5). The generic template was 



developed based on an analysis of GPCR sequences and the 
constructed model was optimized using energy minimiza- 
tion and molecular dynamic simulations. We tested the mod- 
eling approach by constructing a model of Rh that we 
showed agreed well with a density projection map of the 
protein and with structural inferences derived from bio- 
chemical and mutational experiments. Highly conserved res- 
idues in all helices (Asn-43 in TM-1, Asp-71 in TM-2, Ser-112 
in TM-3, Trp-150 in TM-4, Pro-203 in TM-5, Trp-279 in TM-6 
and Asn-316 in TM-7) (Fig. 1) were used to guide the initial 
construction. In our initial model of the TRH /TRH receptor 
complex, we docked TRH into the TMH bundle of TRH 
receptor and, using a novel simulation approach, optimized 
the complex by extensive sampling of the configurationai 
and conformational degrees of freedom (18). This model 
predicted that, in addition to the experimentally determined 
interaction of pyroGlu with Tyr-106 and Asn-110 in TMH3, 
Tyr-282 in TMH6, and Arg-306 in TMH7 could interact with 
the His and ProNH 2 of TRH, respectively. These predictions 
were tested using synthetic analogs of TRH and mutational 
analysis of TRH receptor, and the experimental observations 
were shown to be consistent with the predictions (19-21). 
The final model shows that the py roGlu moiety of TRH forms 
H-bonds with Tyr-106 and Asn-110 in TMH3, His interacts 



G PROTEIN-COUPLED RECEPTOR FUNCTION 



Endo • 2001 
Vol. 142 • No. 1 



hydrophobically with Tyr-282 in TMH6, and the ProNH 2 
forms H-bonds with Arg-306 in TMH7. Thus, these studies 
allowed us to describe the molecular interactions of TRH 
with the binding pocket within the TMH bundle of TRH 
receptor. 

Ter Laak, Timmerman, Leurs, and colleagues (22, 23) con- 
structed a model of an agonist/histamine H x receptor com- 
plex based on a Br TMH template using experimental evi- 
dence for the interactions between histamine agonists and 
specific receptor residues. The model predicted an Asp in 
TMH3, a Trp in TMH4, a Lys in TMH5 and two Phe residues 
in TMH6 with which ligands would interact. Experiments 
involving functional studies with mutated receptors and dif- 
ferent antagonists provided evidence that the predictions 
were correct. These studies allowed delineation of agonist 
and antagonist binding pockets in histamine HI receptor. 

Befort and colleagues (24) constructed a model of the TMH 
bundle of the 6-opioid receptor based on a Br template and 
sequence alignment with other GPCRs. The model predicted 
that a series of aromatic residues in TMHs project into the 
bundle. These residues were mutated and binding of opioid 
analogs was measured. Evidence was obtained that a Tyr in 
TMH3, a Trp in TMH4, a Phe in TMHS, a Trp in TMH6, and 
a Tyr in TMH7 provide interactions with the ligands. These 
findings further demonstrated the importance of aromatic 
residues in binding small ligands. 

Almaula and co-workers (25) constructed a model of the 
serotonin type 2A receptor that predicted that the ligand 
would interact with a Ser one helical turn below the Asp in 
TMH3 that binds the cationic primary amino group of se- 
rotonin. This prediction was shown to be consistent with data 
from experiments using Ser to Ala and to Cys mutations and 
serotonin analogs. 

Sequential binding of small ligands to GPCRs 

Based on our model of the unoccupied TRH receptor (26, 
27) (Fig. 4), we proposed that TRH binds to TRH receptor in 
at least three discrete steps. TRH interacts with: 1) Tyr-181 on 
the surface of the ECLs; 2) with Asn-289 and Ser-290 at the 
ECL/TMH boundary; and 3) with TMH residues (see above). 
Specifically, it was predicted that the pyroGlu of TRH in- 
teracts with Tyr-181 then with Asn-289/Ser-290 and then 
with Tyr-106/Asn-110. These predictions were tested exper- 
imentally. Using a mutation of Tyr-181 to Phe, we showed 
that TRH interacted with Tyr-181 via the pyroGlu moiety 
because a mutant receptor in which Phe was substituted for 
Tyr-181 (Y181F) was not able to discriminate between TRH 
analogs changed in the pyroGlu position but was able to 
distinguish between TRH and analogs substituted in the His 
or ProNH 2 positions. Similar findings were obtained with 
Y106F and N110A mutant TRH receptors, and with N289A 
and S290A mutant receptors. Most importantly, N289A ex- 
hibited a decreased binding affinity for TRH that was ac- 
counted for by a decreased rate of association rather than 
N110A that exhibited a similar decrease in affinity but in 
which the decreased affinity was caused by an increased rate 
of dissociation (27). Thus, we concluded that TRH is attracted 
to TRH receptor via interactions with Tyr-181, is conducted 
down a funnel formed by the ECLs through which it enters 




Fig. 4. A depiction of the funnel formed by the extracellular loops 
that guides TRH into the transmembrane binding pocket. The surface 
has been partially removed to show the funnel, with Tyr-181 at its 
entrance, as well as the transmembrane (TM) binding pocket. This 
figure is based on recent simulation conducted by Dr. Rosenhouse- 
Dantsker. 

the TMH binding pocket in which it exerts its effect to change 
the conformation of TRH receptor that comprises the conver- 
sion of the receptor from an inactive to active state (see below). 

Using molecular modeling and experiments with mutant 
receptors and purinergic analogs, Moro and colleagues (28) 
drew similar conclusions regarding sequential binding of 
purinergic ligands to the P2Y 1 receptor. 

Thus, it appears that small ligands enter their binding 
pockets within the TMH bundle after attraction and sequen- 
tial interactions with ECL residues. 

Peptide ligand binding pockets 

Several groups of investigators have used initial models of 
ligand /GPCR complexes to make predictions regarding 
other interactions between receptor residues and ligands to 
define the binding pockets of receptors for peptides more 
completely. The binding pockets for nonpeptide ligands in 
these same receptors were analyzed also. In a series of reports 
(6, 7, 29), Fourmy and colleagues described experiments in 
which they used a series of cholecystokinin (CCK) analogs 
and cholecystokinin-A receptor (CCK-AR) mutants coupled 
with computer simulations to develop a model of CCK bind- 
ing and CCK-AR activation. An initial model of CCK-AR was 
constructed based on the TMH template of Br, and Rh and 
/32-adrenergic receptor for sequence alignment. Based on 
experimental findings that two residues in the amino termi- 
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nus of CCK-AR bind the amino terminus of CCK-9 and 
docking the remainder of the CCK-9 peptide into the TMH 
bundle of CCK-AR, a model of the CCK/CCK-AR complex 
was constructed. (The 3D structure of CCK-9 was based on 
biophysical measurements.) The model was optimized by 
energy minimization and molecular dynamics simulation 
protocols. This model predicted that a Met residue in ECL2 
of CCK-AR interacted with the sulfated Tyr in position-3 of 
CCK-9. Using site-specific mutagenesis of this Met and 
CCK-9 analogs, these authors tested this hypothesis and 
presented evidence that supported an interaction between 
this Met and Tyr(S0 3 H)-3 as being important for ligand 
binding and receptor activation. In a subsequent study, pre- 
dictions of the model that the carboxylate of Asp-8 and the 
carboxyl terminal amide of CCK-9 interacted with Asn and 
Arg residues at the boundary of TMH6 and ECL3 were 
tested, and data supporting these interactions were obtained. 
From these findings, the authors were able to identify a 
hydrophilic group of receptor residues that interact with 
CCK-9 and hypothesized that electrostatic interactions were 
important in converting CCK-AR from an inactive to an 
active state. Gouldson and co-workers (30) constructed a 
model of CCK-AR and presented evidence based on muta- 
tional analyses that a nonpeptide agonist and antagonist 
bind via interactions with different residues within the TMH 
bundle. The antagonist was predicted to bind near the ex- 
tracellular surface of the TMH bundle via a limited number 
of contacts whereas the nonpeptide agonist, like CCK-8, 
bound deeper within the bundle and made many more con- 
tacts with receptor residues. 

Jarnagin and colleagues (12) constructed a model of the 
TMH bundle of the B2 bradykinin (BK) receptor based on Br 
and mutagenesis experiments of several GPCRs. The con- 
formation of BK in solution resolved by nuclear magnetic 
resonance spectroscopy was docked into the receptor model. 
Based on predictions of the model, a number of residues in 
the TMH5, 6, and 7 were mutated and data consistent with 
interactions between these residues and BK were provided. 

Feighner and colleagues (31) constructed an energy- 
minimized model for the hexapeptide GH-releasing peptide 
(GHRP-6) receptor. The model predicted interactions of the 
ligand with residues in TMH2, 3, 5, and 6 and in ECL1 that 
were tested by site-specific mutations. The data obtained 
were consistent with the predictions. 

Macdonald and colleagues (32) constructed a melanin- 
concentrating hormone (MCH) /MCH receptor complex by 
docking MCH in a conformation obtained by molecular dy- 
namics simulation into a receptor model based on a Br tem- 
plate. The model predicted an interaction between an Asp in 
TMH3 and Arg-11 in the 19-amino acid, cyclic peptide. The 
prediction was tested experimentally by mutation of 
the receptor and synthesis of position-11 MCH analogs. The 
findings were consistent with a role for the Asp in TMH3 in 
binding MCH and in receptor activation. 

Mouillac, Barberis, Hibert, and co-workers (8, 9, 33, 34) 
have used models of the vasopressin V la receptor to study 
binding of agonists and antagonists. Their model predicted 
that a primary binding pocket for the cyclic, nonapeptide 
vasopressin was within the TMH bundle (see above), and 
this was confirmed by mutational analysis of the receptor. 



They next studied the binding of linear peptide and non- 
peptide antagonists. They found that aromatic residues in 
TMH6 of the receptor were needed for binding antagonists 
even though they are not involved in binding agonists. These 
findings provide new insights that could be used in rational 
design of antagonists vs. agonists. 

Role of highly conserved Asn in TMHl t Asp in TMH2 and 
Asn in TMH7 in GPCR structure 

Sealfon, Weinstein, and colleagues (35, 36) used models of 
the GnRH receptor to predict structural features that were 
important for receptor function. They noticed that the highly 
conserved pattern of Asn in TMH1, Asp in TMH2 and Asn 
in TMH7 was changed in GnRH receptor to Asn in TMH1, 
Asn in TMH2 and Asp in TMH7. A model of the TMH bundle 
of GnRH receptor was constructed and energy minimized. 
The model predicted that these three residues interact to 
stabilize the helical bundle. The findings from experiments 
using site-specific mutations of these residues and functional 
analyses of the mutant receptors were consistent with this 
prediction. 

Our model of the unoccupied TRH receptor (see above) 
predicted that Asn-43, Asp-71and Asn-316 interacted via 
H-bonds to hold TMH1, 2 and 7 together (37). This prediction 
was tested by site-specific mutagenesis, and the data ob- 
tained were consistent with this hypothesis. 

In the 3D crystal structure of Rh (1), the conserved Asn in 
TMH1 and the Asp in TMH2 interact with the peptide back- 
bone carbonyl one helical turn above the conserved Asn in 
TMH7, and Asp in TMH2 and Asn in TMH7 interact via a 
water molecule also. Thus, models provided the first insights 
into how residues in TMH1, 2, and 7 through intramolecular 
interactions constrained the conformation of TMH bundle of 
GPCRs. 

Role of highly conserved Asn-Pro /Asp-Pro motif in the 
structure of TMH7 

Weinstein and colleagues (38) used results from analysis 
of a structure database search and dynamic simulations to 
construct a model of TMH7 of a GPCR. Their model pre- 
dicted that TMH7 is not an ideal a-helix but contains a kink 
caused by the Asn-Pro /Asp-Pro motif. Using the serotonin 
receptor type 2a as a prototype, they suggested that a model 
incorporating this kinked helix explained previously gener- 
ated experimental data. However, the Rh structure shows 
that the Asn-Pro-Xaa-Xaa-Tyr sequence has a regular helical 
structure, and a kink occurs at a point further along the helix. 
Other parts of TMH7, in particular the area around the Lys 
that covalently attaches to retinal and the sequence near the 
Ala residue whose carbonyl interacts with Asn in TMH1, are 
distorted. 

Role of highly conserved Asp / Glu-Arg-Tyr sequence in 
GPCR activation 

The amino acid sequence Asp /Glu-Arg-Tyr (D/E-R-Y) at 
the intracellular end of TMH3 has been shown to be impor- 
tant in receptor activation in several GPCRs. It was initially 
shown that the charged pair of Glu-Arg was needed for 
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rhodopsin activation because double mutants of these resi- 
dues failed to activate transducin (39). Scheer, Fanelli, and 
colleagues (40, 41) constructed models of native a 1B -adren- 
ergic receptors and of receptors with mutations of the Asp 
and Arg residues of the d-R-Y sequence that exhibited con- 
stitutive signaling activities. Functional experiments with 
these receptors showed that mutation of Arg to several other 
amino acids caused inhibition of stimulated signaling and to 
Lys increased basal signaling, whereas mutation of Asp to 
Ala caused a marked increase in basal activity. Their models 
predicted that the Arg residue was located in a hydrophilic 
pocket formed by the highly conserved Asn in TMH1, Asp 
in TMH2 and Asn and Tyr in TMH7 (see above) in the 
receptors that were not basally active and that this Arg was 
not in this pocket in basally active a 1B -adrenergic receptor 
mutants. Based on their experimental observations and 
model predictions, and the finding that activation of Rh is 
associated with proton uptake (42), these investigators pro- 
posed the following hypothesis for activation of the a 1B - 
adrenergic receptor. The inactive receptor is restrained by 
interactions between Arg of d-R-Y and the hydrophilic res- 
idues of the pocket formed between TMH1, 2 and 7. Activation 
is secondary to protonation of the Asp in d-R-Y that causes the 
Arg of the d-R-Y sequence to move out of the TMH bundle and 
changes the orientation of residues in ICL2 and 3 that allow for 
increased affinity of coupling to G protein-coupled. 

Fanelli and colleagues (43) analyzed the Asp-Arg-Cys, 
rather than Asp-Arg-Tyr in most GPCRs, sequence at the end 
of TMH3 in the oxytocin receptor by mutation and modeling. 
Mutation of the Arg to Ala caused this receptor to become 
constitutively active. Nevertheless, they predicted that the 
same movement of Arg from a hydrophilic pocket formed by 
TMH1, 2, and 7 in the oxytocin receptor creates a crevice 
formed by ICL2 and 3 and the intracellular extension of 
TMH5 to which a G protein may couple. This hypothesis is 
similar to the one they proposed for the a 1B -adrenergic re- 
ceptor (see above). 

A different hypothesis regarding a protonation-dependent 
mechanism of activation involving Arg at the intracellular 
aspect of TMH3 has been proposed for the GnRH receptor. 
From their models, Ballesteros and colleagues (36) predicted 
that the Arg of the Asp-Arg-Ser sequence at the end of TMH3 
interacts with the adjacent Asp in the inactive state of the 
GnRH receptor. Activation involves release of the Arg from 
interacting with Asp by Asp protonation and promotes 
movement of the Arg into a hydrophilic pocket in the TMH 
bundle. An He one helical turn above the Arg in TMH3 sterically 
directs the Arg into the TMH hydrophilic pocket. They pro- 
vided experimental support for this hypothesis by finding that 
mutation of Arg markedly decreased agonist-stimulated acti- 
vation whereas receptors with mutations of Asp to Asn or He 
to Ala were less well stimulated than GnRH receptor. 

Thus, these models predict that movement of the sidechain 
of the Arg of the highly conserved E/d-R-Y sequence is in- 
volved in receptor activation. In particular, they support a role 
for the Arg as a switch to convert a GPCR from an inactive to 
an active state in response to agonist-stimulated receptor pro- 
tonation. A recent report, however, has provided evidence that 
the conserved Asp in the d-R-Y sequence is not necessary for 



protonation-dependent activation of the /^-adrenergic receptor 
even though protonation may be involved (44). 

Role of transmembrane helix movement in GPCR activation 

Experiments with mutant rhodopsins in which double 
site-specific substitutions by Cys labeled for electron spin 
resonance spectroscopy demonstrated that movement of 
TMHs relative to one another were involved in receptor 
activation (45). In the case of Rh, TMH3 and 6 were shown 
to exhibit relative motion upon light activation. Movement 
of TMHs of Rh upon conversion of 11-cis-retinal to all trans- 
retinal is predicted from the crystal structure also (1). Gether 
and co-workers (46) performed experiments using a confor- 
mationally sensitive Cys-reactive reagent and found that 
positions in TMH3 and 6 changed their environments upon 
binding /^-adrenergic agonists. Based on their model of the 
0 2 -adrenergic receptor, they concluded that these move- 
ments were similar to those proposed in Rh. 

Based on our analyses of unoccupied and TRH-occupied 
TRH receptor models, we (47) predicted relative movement 
of TMH5 and 6 upon activation of TRH receptor. To provide 
evidence in support of this hypothesis, we mutated residues 
in TMHS and 6 that our model predicted were involved in 
interhelical aromatic interactions that constrain the receptor 
in the inactive conformation. Mutation of these residues to 
amino acids that could not form aromatic interactions should 
allow these helices to move apart and thereby activate TRH 
receptor. As predicted, mutation of Phe-199 in TMHS and of 
Trp-279 in TMH6 to Ala produced receptors that were con- 
stitutively active. 

Lin and colleagues (15) constructed a model of the LH/CG 
receptor and used it along with an analysis of naturally 
occurring constitutively activating mutations to predict 
TMH movements that constitute LH/CG receptor activation. 
They concluded that hydrophobic interactions among resi- 
dues in the intracellular halves of TMHS and 6, and polar 
interactions between residues in TMH6 and 7 constrain 
LH/CG receptor in an inactive conformation. They sug- 
gested that activation involves agonist-induced disruption of 
these interactions that allows TMH6 movement. Findings 
inconsistent with some of these predictions, however, have 
been reported (16). An analysis of a number of constitutively 
active LH/CG receptor mutants supports the idea that ac- 
tivation is caused by release of intrahelical interactions (48). 

Fanelli and colleagues (43) predicted movements of 
TMH3, 4, 5, and 6 as components of activation of the oxytocin 
receptor secondary to movement of the Arg sidechain of the 
E/d-R-Y sequence (see above). Zhang and Weinstein (49) 
predicted that TMHS and 6 underwent the greatest motion 
when agonists bound to serotonin type 2 receptors. 

Biologically active conformation of an agonist 

Prevailing theory of GPCR activation holds that the re- 
ceptor can assume many conformations within a spectrum 
from inactive to fully active and that different agonists will 
lead to different subsets of receptor conformations. In gen- 
eral, especially for small ligands, conformational changes of 
the agonist are not considered. However, to understand re- 
ceptor activation, the conformation of the active pharma- 
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cophore is critical. We (50) found that the conformation of 
TRH in our model of the TRH/TRH receptor complex was 
different from the predominant conformations of TRH in 
solution. To gain insight into the biologically active TRH 
conformation, Dr. Kevin Moeller (Washington University, St. 
Louis, MO) synthesized conformationally restricted analogs 
of TRH. We characterized two diastereomers of an analog 
restricted by addition of a methylene bridge between the 
second and third positions of a TRH analog, pyroGlu<yclo- 
hexylAla-ProNH 2 . We found that one diastereomer bound 
with higher affinity and exhibited higher intrinsic activity 
than pyroGlu-cyclohexylAla-ProNH 2/ whereas the other di- 
astereomer bound with lower affinity and was less active 
than pyroGlu-cyclohexylAla-ProNH 2 . Computer simula- 
tions of these analogs predicted that the higher affinity, more 
active diastereomer assumed a conformation that was sim- 
ilar to TRH in our model of the TRH/TRH receptor complex 
whereas the lower affinity, less active diastereomer was sim- 
ilar in structure to TRH in solution. Thus, our model of the 
TRH/TRH receptor complex predicted a biologically active 
conformation of TRH. These types of predictions may allow 
for the design of better agonists of GPCRs. 

Conclusions 

The emergence of the relationship between structure and 
function in GPCRs has been made possible by the combina- 
tion of computational modeling and molecular pharmaco- 
logical experiments. We are only at the beginning of this 
process, but we have already made progress in understand- 
ing the basis for receptor selectivity, i.e. how ligands bind to 
receptors. We are beginning to delineate the molecular de- 
tails of the consequences of agonist interaction with recep- 
tors, i.e. the molecular basis for receptor activation. We are 
also gaining increasing understanding of how mutated re- 
ceptors become constitutively active and lead to human dis- 
ease. One of the most exciting recent developments is the 
determination of the structure of rhodopsin. Hopefully, 
structures of other GPCRs will follow. However, while the 
structure of rhodopsin may provide a guideline to the struc- 
tures of related receptors, it does not delineate the details of 
the structures of other receptors and, more importantly, it 
does not elucidate the structural changes that lead to receptor 
activation. Thus, a combination of theory and experiment 
will remain the approach of choice not only to discover the 
basis for receptor function but also to successfully design 
therapeutic agents to treat human disease. 
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Nearly 2000 G protein-coupled receptors (GPCRs) 1 have been 
reported since bovine opsin was cloned in 1983 (1) and the ^adre- 
nergic receptor in 1986 (2). They are classified into over 100 Bub- 
families according to the sequence homology, ligand structure, and 
receptor function. A substantial degree of amino acid homology is 
found among members of a particular subfamily, but comparisons 
between subfamilies show significantly less or no similarity. 

Mutations have been observed that relate to a wide spectrum of 
hereditary and somatic disorders and diseases from cancer to in- 
fertility. These mutant receptors are incapable of binding ligand or 
generating normal signals, constitutively generate signals, or are 
not appropriately expressed on the cell surface. On the other hand, 
some mutations are beneficial. For example, a mutation in a che- 
mokine (CCR5) receptor, which is a co-receptor for human immu- 
nodeficiency virus (HIV), prevents binding of HIV to target cells 
and consequently prevents HIV viral infection among the majority 
of homozygotes with this mutation (3). 

Although the majority of GPCRs mediate signal transduction via 
G proteins, emerging evidence indicates that some of these recep- 
tors are also capable of sending signals via alternative signal mol- 
ecules, e.g. Jak2 kinase, phospholipase Cy, or protein kinase C. 
These alternative pathways are an indication of the overall diver- 
sity occurring in the GPCR superfamily. Furthermore, there are 
putative seven transmembrane molecules, which do not appear to 
be coupled to a G protein. 

The most striking difference has been observed in the sites and 
modes of ligand binding and signal generation, which not only 
manifests the diversity but also indicates the availability of numer- 
ous alternative approaches to clinical and industrial applications. 
In this review, we describe the general structure and ligand inter- 
actions of the receptors, and in the following review, Kobilka's 
group (4) focuses on the conformational changes during receptor 
activation. 

General Structure: N-terminal Segment, Seven TMs, Three 
Exoloops, Three-Four Cytoloops, and C-terminal Segment 
As Bhown in Fig. 1A, all GPCRs have an extracellular N-terminal 
segment, seven TMs, which form the TM core, three exoloops, three 
cytoloops, and a C-terminal segment. A fourth cytoplasmic loop is 
formed when the C-terminal segment is palmitoylated at Cys. Each 
of the seven TMs is generally composed of 20-27 amino acids. On 
the other hand, N-terminal segments (7-595 amino acids), loops 
(5-230 amino acids), and C-terminal segments (12-359 amino ac- 
ids) vary in size, an indication of their diverse structures and 
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functions. Interestingly, there is a weak positive correlation be- 
tween an N-terminal segment's length and ligand size (5), suggest- 
ing a role in ligand binding, in particular for large polypeptides and 
glycoprotein hormones. A notable exception is the -600-amino acid 
N-terminal segment of neurotransmitter receptors such as the 
calcium receptor. 

Why Seven TMs?— The ubiquitous adoption of a seven TM struc- 
ture raises the inevitable question concerning its structural and 
functional merits. Odd numbers of TMs place the N- and C-termi- 
nal segments at opposite membrane surfaces. It allows glycosyla- 
te and ligand binding at the N-terminal segment, and phospho- 
rylation and palmitoylation at the C-terminal segment for 
desensitization (6) and internalization. Seven TMs may be the 
minimum necessary to form six loops and a TM core with a suffi- 
cient size and versatility to offer a prodigious number of specifici- 
ties, regulatory mechanisms, and contact sites for G protein and 
other signal molecules such as Jak2 kinase (7), phospholipase Cy 
(8), GPCR kinases (6), arrestin (6), calmodulin (9), and/or protein 
kinase C (9). In contrast, five TMs may be insufficient to form a 
stable yet flexible TM core, whereas nine would be more than 
enough. For example, ion-gated channel proteins are pentamers of 
four TM subunits, which form a central hydrophilic channel (5) or 
a 20-TM monomer (10). In either case, they comprise 20 TMs, and 
the channel is narrow, excluding ions >5 A (10). Apparently, four 
TMs by themselves are not sufficient to form a functional core. 

Kinked and Tilted TM a-Helices of Unequal Length and Hydro- 
phobicity — TMs are likely to assume diverse structures. The TMs of 
bacteriorhodopsin (11) and animal rhodopsin (12) form a-helices al- 
though some TM of other membrane proteins such as porins have 0 
structures (13). TM a-helices vary in length and can extend beyond 
the lipid bilayer. Therefore, the boundaries between TMs and loops 
are likely to be uneven and may be dynamic. TMs 1, 4, and 7 are 
significantly more hydrophobic than TMs 2, 3, 5, and 6 that contain 
several ionic and/or neutral residues. In fact, some of the more hy- 
drophobic TMs 1, 4, and 7 have only one hydrophilic residue such as 
Asn or Ser. Pro residues are frequently found in TM a-helices, unlike 
a-helices in globular proteins. A Pro can kink the helix backbone by 
-26° and impact the global structure. Some TM a-helices are tilted in 
the membrane. For example, TM 3 of animal rhodopsin is most tilted 
by -30°, whereas TMs 4, 6, and 7 are the least tilted (12). TMs 3, 5, 
and 6 are suggested to protrude more than the others from the lipid 
bilayer toward the extracellular surface. TMs often contain Cys res- 
idues, and some are believed to be apposed although there is no 
convincing evidence for a TM disulfide linkage. 

Counterclockwise Orientation and Closed Loop Formation of 
Seven TMs — The seven TMs of bacteriorhodopsin (11), animal rho- 
dopsin (12), and adrenergic receptors (14) are arranged as a closed 
loop in the counterclockwise direction from TM 1 to TM 7 when 
viewed from the extracellular surface (Fig. 1A). The orientation of 
the TMs imposes a stereo- and geometric specificity on a ligand's 
entry into and binding to the TM core. In this arrangement, the 
core is primarily comprised of TMs 2, 3, 5, 6, and perhaps 7, 
whereas TMs 1 and 4 are peripherally sequestered. In general, 
TMs 1, 2, and 7 are apposed (15). This arrangement is consistent 
with the view that the more hydrophobic TMs 1, 4, and 7 are 
exposed more to the lipid bilayer than the less hydrophobic TMs 2, 
3, 5, and 6. It is unknown whether there are GPCRs that are 
open-looped or not closed between TMs 1 and 7. Different arrange- 
ments can provide interesting mechanisms for ligand binding and 
receptor activation. For example, if TM 7 associates more closely 
with TM 2 than TM 1, there wilj be a long crevice between TMs 1 
and 7. Such a crevice offers a binding site for the fatty chain of 
lysophosphatidic arid. Also, TMs 1 and 7 of a GPCR may associate 
with TMs 1 and 7 of another GPCR to form a dimer. 

TMs are entropically driven into the lipid bilayer primarily by 
water molecules present outside of the membrane. However, it is 
unclear what forces are primarily responsible for arranging TMs 
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Fig. 1. Schematic presentation of the general structure of GPCRs and receptor-llgand interactions. A, general structure and terminology. 
Several distinct modes have been observed for ligand binding and rignal generation at exclusively the TM core for photon, biogenic amines, nucleosides, 
eicosanoids, and moieties (lysophosphatidic acid and sphingosine l-onosphate) of lipids (fl), and the core, exoloops, and N-tenninal segment for peptides of 
s40 amino acids (C). D, protease Uganda like thrombin bind to and cleave the N-terminal segment The resulting shorter N- terminal segment interacts with 
exoloops to generate a signal, whereas the released peptide binds to platelet to stimulate platelet aggregation. E, glycoprotein hormones, LH, FSH, human 
CG, and TSH, bind to the ~350-amino acid N-terminal segment and the liganded N-terminal segment interacts with exoloops to generate a signal. F, small 
neurotransmitters, Ca a+ , glutamate, and GABA, bind to the ~600-amino acid N-tenninal segment, and the liganded N-terminal segment interacts with the 
membrane-associated domain, thus generating a signal. 



within the lipid bilayer: for example, whether the hydrophilic sides 
of TMb are entropically driven away from the lipid bilayer, the 
hydrophobic sides of TMs prefer to associate with the lipid bilayer, 
or groups in the TM core bond together, e.g. by van der Waals 
interactions. 

TM Core Tightly Packed by Hydrogen Bonds and Salt Bridges— 
The recent high resolution three-dimensional structure of bacteri- 
orhodopsin (11) suggests that the TM core does not have a vacant 
pocket, channel, or tunnel structure, as generally thought. Al- 
though the TM core has polar residues and, therefore, is considered 
hydrophilic, there are only a limited number (-10) of associated 
water molecules (11). Extensive hydrogen bonds exist between 
residues of the same TM as well as other TMs like TMs 2 and 7 (11, 
15). In addition to hydrogen bonds, conserved TM 2 Asp pairs with 
TM 1 and TM 7 Asn residues in the TSH-releasing hormone recep- 
tor, and conversely, TM 2 Asn with TM 7 Asp in the GnRH receptor 
(15). Although TMs have Cys residues and some could be paired, no 
specific pairs have been identified. Because of these interactions, 
there may not be space for a TM pore in most GPCRs. The core of 
bacteriorhodopsin is so tightly packed in the basal state that trans- 
fer of a proton or water molecule is unlikely (11). This is reminis- 
cent of the observation that the 20-TM ion channels are not large 
enough for ions >5 A to pass (10). 

Disulfide Bonds — Disulfides always play a major role in the 
structure and stability of proteins. Among numerous Cys residues 
of eukaryotic GPCRs, the two conserved Cys residues in exoloops 1 
and 2 are known to be linked by a disulfide in bovine rhodopsin, the 
TSH-releasing hormone receptor, the thromboxane receptor, and 
the GnRH receptor. This disulfide is expected to constrain the loops 
and receptor, specifically pulling exoloop 2 over and thus blocking 
the extracellular opening of the core. Surprisingly, such a structure 
is not seen in bacteriorhodopsin, which has no Cys. Instead, exoloop 
2 of bacteriorhodopsin is projected away from the core opening and 
is not near exoloop 1. Furthermore, the two corresponding Cys 
residues in some GPCRs have been suggested to pair with another 
Cys rather than with each other (16). Disulfides exist in the N- 
terminal segment of the glycoprotein hormone receptors (17). 



Ligand Binding and Receptor Activation 
Several distinct modes (Fig. 1, B-F) have been observed for high 
affinity ligand binding to the TM core exclusively (photon, biogenic 
amines, nucleosides, eicosanoids, and moieties (lysophosphatidic 
acid and sphingosine 1-phosphate) of lipids), to both the core and 
exoloops (peptides £40 amino acids), to exoloops and N-terminal 
segment (polypeptides ^90 amino acids), or exclusively to the N- 
terminal segment (glycoproteins =»30 kDa). The distinction be- 
tween ligand binding and receptor activation is supported by the 
existence of antagonists that competitively inhibit agonist binding. 
Further, glycoprotein hormones bind to the N-tenninal segment 
whereas receptors are activated at the membrane-associated do- 
main (5). Interactions of Uganda and receptors seem to involve 
Hydrogen bonds, ion pairs, and hydrophobic contacts. 

Ligand Binding and Signal Generation in the TM Core 
To define better the mechanics of receptor activation it may be 
divided into at least three steps, signal generation, TM signal 
transduction, and signal transfer to cytoplasmic signal molecules 
(5). The simplest way to activate receptors is for a ligand to bind, 
generate a signal in, and transduce it through the TM core (Fig. 
IB). In such a case, changes in TM hydrogen bonds are likely to be 
the underlying mechanism for signal generation and TM reorgani- 
zation. For example, photoactivation of retinal straightens its bent 
configuration and rearranges hydrogen bonds and the packing of 
TMs 3 and 5-7. In biogenic amine receptors, TM 3 is believed to be 
the primary site for ligand binding whereas TMs 5 and 6 are the 
signal generation site. Although initial ligand binding and signal 
generation are related yet separate phenomena, it is technically 
difficult to distinguish them. However, a few receptor systems such 
as glycoprotein hormone receptors and protease-activated recep- 
tors are available as good models to distinguish experimentally 
ligand binding from signal generation and thus investigate the 
transition between these two steps. 

Rhodopsin — The family of photoreceptors comprises several iso- 
forms specific for red, blue, and green colors. These color charac- 
teristics stem from variations in their amino acid sequences. All of 
them have a retinal chromophore, which is attached through for- 
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mation of a Setoff s base between the aldehyde moiety of retinal 
and the c-amine of the Lys in the middle of TM 7 (18). The proto- 
nated Schiff s base is paired with conserved Glu 113 present at the 
boundary between TM 3 and exoloop 1, thus bringing TMs 3 and 7 
in apposition. Photoaffinity labeling, mutational analysis, and 
modeling show that the j3-ionone ring of retinal associates with 
TMs 3, 5, and 6, particularly TM 6 Trp 265 and Tyr 268 (19). The C 9 
methyl group of 11-cis-retinal appears to associate with TM 3 
Gly 125 (19). Light absorption causes a bent 11-cis to linear a\\-trans 
isomerization and part of the TM peptide backbone to be exposed. 
As a result at least one water hydrogen bond is formed (20) and the 
salt bridge constraint between TM 3 Glu 113 and TM 7 Lys 296 may 
break. Because each hydrogen bond energy is worth 3-7 kcal/mol, 
these changes are likely and sufficient to rearrange TMs, particu- 
larly TMs 3, 6 (19), and 7, thereby generating a signal. 

Biogenic Amine Receptors — Biogenic amines, epinephrine, nor* 
epinephrine, dopamine, and histamine enter and bind the TM core 
(4, 18). The amine of catecholamines pairs with the carboxyl group 
of TM 3 Asp n3 of the receptor, 9 A deep in the core. TM 3 Asp 113 is 
crucial for ligand binding but not for signal generation. The fact 
that Asp 113 could not be replaced with Glu underscores the tight 
space and importance of the size and orientation of the carboxyl 
chain. The catechol ring of the ligand is thought to dock the pocket 
consisting of TMs 5 and 6. Specifically, the meta and para OH 
groups of the catechol ring appear to hydrogen bond, respectively, 
to TM 5 Ser 204 and Ser 207 , which lie one a-helical turn apart on the 
same side of the TM 5 a-helix. Because this double hydrogen 
bonding would constrain the TM 5 a-helix (18) and/or the ligand 
and the ligand-linked TM 3 a-helix, the TM structure and packing 
are expected to be adjusted. In fact, the TM 6 structure, particu- 
larly Cys 235 , changes in activated receptors (21), and Phe 290 is 
thought to stabilize the catechol ring (18). Because the interactions 
of the catechol ring are more important for receptor activation than 
ligand binding, these changes could generate a signal. In addition, 
the interaction of TM 3 Asp 125 and TM 7 Lys 331 may break upon 
receptor activation (22). For histamine, the amine may ion pair 
with the TM 3 Asp, whereas the imidazole ring associates with TM 
5 Asp and Thr, which correspond to TM 5 Ser 204 and Ser 207 
of catecholamine receptors (18). Acetylcholine also appears to 
bind similarly the TM core, particularly TMs 5 and 6 for signal 
generation (18). 

Nucleoside and Nucleotide Receptors — Extracellular adenosine, 
adenine nucleotides, and urdine nucleotides are capable of binding 
G protein-coupled metabotropic receptors (P2Y receptors). Studies 
of metabotropic adenosine receptors suggest that ligands bind pri- 
marily in the TM core and exoloops. For the adenosine A x receptor 
ligand binding requires TMs 1-4 (23), 6, and 7 as well as exoloop 2 
(24). On the other hand, TMs 5-7, particularly Ser 277 and His 278 of 
TM 7, are important for ligand binding by the adenosine A 2 recep- 
tor (25). Signal generation by adenosine receptors appears to re- 
quire additional unknown ligand contacts, because agonists and 
antagonists bind the receptors differently. Four subclasses of che- 
moattractant cAMP receptors are found in Dictyostelium. These 
receptors are coupled to a G protein that activates adenylyl cyclase. 
For ligand binding by the cAMP receptor 1, TMs 3, 5, and 6 and 
exoloop 3 are important, and exoloop 2 appears to modulate ligand 
access to the TM core binding site (26). 

Eicosanoid Receptors — Eicosanoids are derived from arachidonic 
acid and have a carboxyl group. This family of lipid hormones, 
which includes leukotrienes and prostanoids (prostaglandins, pros- 
tacyclins, and thromboxanes), binds specific subfamily receptor 
members of high homology. The ligands bind the TM core, and 
interactions with TMs 3, 6, and 7 are crucial (27, 28). A conserved 
TM 7 Arg near the extracellular surface is thought to pair with the 
carboxyl group of the ligand (27). The structure and orientation of 
the cyclopentane ring of prostanoids influence receptor specificity 
(28), suggesting variations in the TM core structure. 

Short Peptide Receptors with Ligand Binding Partially in the TM 
Core and Exoloops 
Formyl Receptor— Tripeptide JV-formyl-Met-Leu-Phe is one of 
the smallest peptide ligands for GPCRs. Spectroscopic and muta- 
tional studies (50) show that the JV-formyl moiety of the ligand 
binds in the TM core around TMs 2 and 3, whereas the C-terminal 
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region of the ligands associates with the N-terminal segment and 
exoloops 1 and 2 (29). Recently, the VaF-Arg^-Lys 85 sequence 
that joins TM2 and exoloop 1 has been shown to be photoaffinity 
labeled with the ligand (30). 

Glycoprotein Hormone-releasing Hormone Receptors — Muta- 
tional analyses of the thyrotropin-releasing hormone (pyroEHP- 
NH 2 ) receptor show that several amino acids in TM 3 and one in 
exoloop 2 are important for ligand binding. Mutation and affinity- 
labeling studies of gonadotropin-releasing hormone suggest the N 
terminus of GnRH binds the TM core and the C terminus around 
exoloops 2 and 3. His 2 of GnRH is believed to be close to TM 3 
Lys 121 , Arg 8 of GnRH to exoloop 3 Asp 302 , and the C-terminal 
glycinamide to exoloop 2 Asn 102 (15). Also, the N-terminal segment 
appears to be near the ligand (31). Because TM 3 Lys 121 is -7.6 A 
away from the extracellular surface and its side chain could extend 
to near the surface, GnRH enters, but not deeply into, the TM core. 

Angiotensin Receptor — The hydrophobic C-terminal region of an- 
giotensin II (DRVYIHPF) appears to enter the TM core of the 
receptor, and the C-terminal carboxyl group pairs with TM 5 Lys 199 
(32), 7-14 A from the extracellular surface. On the other hand, 
Asp-Arg of the DRVYIHPF sequence seems to ion pair with exoloop 
2 His 183 and exoloop 3 Asp 281 of the receptor, respectively (33). The 
interaction with Asp 281 is more crucial for signal generation. 

Multistep Contacts at the N-terminal Segment and Exoloops 
Receptors for Glucagon, Calcitonin, and Vasoactive Intestinal 
Peptides — This class of peptide hormones is 30-40 amino acids 
long. On the other hand, the N-terminal segments of their recep- 
tors are 116-147 amino acids long and primarily responsible, but 
not sufficient, for high affinity ligand binding, as exoloops also 
appear to be required (34). Partial entry of the ligand into the TM 
core is a possibility. Gly l0 -Arg a4 of calcitonin is essential for inter- 
action with the N-terminal segment of the receptor. The resulting 
transient hormone-receptor complex, including particularly the N- 
terminal 10 residues of calcitonin, may make secondary contact 
with exoloops and generate a signal (34). In the glucagon receptor, 
Asp 126 -Lys 137 of the 145-amino acid N-terminal segment and ex- 
oloop 1 are crucial for ligand binding (35). 

Parathyroid Hormone (PTH) Receptors— The C-terminal region 
of 84-amino acid PTH is important for receptor binding, whereas 
the N-terminal region is crucial for receptor activation. For the 
receptor, ligand specificity resides primarily in the N-terminal 
segment whereas signal is generated in the membrane-associated 
domain, particularly involving TM 3 He 244 and exoloop 2 Tyr 318 
(36). These results suggest the two-step interaction of PTH and the 
receptor, first to form the transient PTH-N-terminal segment com- 
plex and next for the complex to interact with the membrane- 
associated domain to generate a signal. 

Initial Ligand Binding at the N-terminal Segment and 
Subsequent Secondary Contact of the N-terminal Segment with 
Exoloops 

Protease-activated Receptors— This family of receptors for 
thrombin and other proteases is activated by proteolysis of the 
N-terminal segment (37). The protease thrombin recognizes the 
sequence L 38 DPRSFLLRNPNDKYEPF 55 in the 74-amino acid N- 
terminal segment of the thrombin receptor and cleaves it at the 
thrombin cleavage sequence, L 38 DPR I S 42 . The N-terminal seg- 
ment is sufficient for the proteolysis by thrombin, and the released 
peptide, Met J - Arg 4 \ is a strong agonist to stimulate platelet ag- 
gregation (38). At the same time, the resulting new 33-amino acid 
N-terminal segment acts as a tethered ligand and intramolecularly 
binds to exoloops of the remaining receptor; in particular Arg 6 of 
the new N-terminal segment interacts with exoloop 2 Glu 260 to 
generate the signal (39). Protease-activated receptor 2 is structur- 
ally similar to thrombin receptors and is cleaved at R 3e I S 37 in the 
N-terminal segment. The resulting new N-terminal segment con- 
tacts exoloops, most likely between the conserved Arg of the N- 
terminal segment and Glu in exoloop 2 (40). 

Glycoprotein Hormone Receptors — The glycoprotein hormone re- 
ceptors consist of two roughly equal domains, a 350-400-residue 
N-terminal segment and a —300-amino acid membrane-associated 
domain. The N-terminal segment alone is capable of high affinity 
ligand binding whereas the membrane-associated domain is the 
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site of receptor activation (5). The N-terminal segment has 8-9 
Leu-rich repeats. These Leu-rich repeats are thought to form a 
crescent with the concave inner surface consisting of 0-sheets, 
which may bind Uganda (41, 42). In addition to the Leu-rich cres- 
cent, the N-terminal region of the N-terminal segment contacts the 
hormone (43). Although the N-terminal segment alone is capable of 
high affinity ligand binding, this is modulated by exoloops of the 
receptor (44). 

Glycoprotein hormones, LH, FSH, CG, and TSH, are the largest 
(30-40 kDa) and most complex GPCR agonists. They are het- 
erodimers of a common a-subunit and a hormone-specific £-sub- 
unit. These hormones initially bind exclusively to the N-terminal 
segment with high affinity. The hormone-N-terminal segment com- 
plex undergoes conformational changes (45) and makes secondary 
interactions with the membrane-associated domain, thus generat- 
ing a signal (5). The initial high affinity interaction includes mul- 
tiple contacts between the N-terminal segment and both subunits 
of the hormone (41, 42). They involve the a C-terminal region, the 
N-oligosaccharide at aAsn 53 , and a unique a-helix in the a-subunit, 
as well as an unusual loop (seat belt) in the /3-subunit and the 
peripheral 0-hairpin loops of both subunits (41). Secondary con- 
tacts occur between the liganded N-terminal segment and exoloops 
1_3 (5, 44, 46). Signal is, therefore, likely generated at exoloops. 
In fact, some mutations in exoloops 1-3 result in constitutive 
activation (47). 

Neurotransmitter Receptors — This family of receptors is noted 
for the longest N-terminal segments (-600 amino acids) that rec- 
ognize glutamate, -y-aniinobutyric acid (GABA), and calcium/cations. 

For metabotropic glutamate receptors, the N-terminal half of the 
-~600-amino acid N-terminal segment not only provides the ligand 
binding site and specificity but also indicates the signal specificity 
for activation of phospholipase C or inhibition of adenylyl cyclase 
(48). Some metabotropic glutamate receptors bind calcium as well 
as glutamate, and Ser l6€ appears to be important (49). The struc- 
ture and size of the GABA B and calcium receptor are similar to 
those of the metabotropic glutamate receptors. Therefore, they are 
expected to similarly bind ligands and be activated. 

Conclusions and Future Directions 

An increasing number of GPCR subfamilies shows diverse modes 
of ligand binding, signal generation, TM signal transduction, and 
signal transfer to various cytoplasmic signal molecules other than 
G protein, such as Jak2 kinase (7), phospholipase Cy (8), and 
protein kinase C (9). Therefore, the question rises whether these 
distinct responses are induced by a single signal originating from a 
single receptor or by separate signals generated by a single recep- 
tor or different receptors. If a single receptor is capable of activat- 
ing one or more effectors, does it activate them simultaneously or 
only one at a time? The general underlying mechanisms appear to 
involve changes in interactions and conformation, particularly hy- 
drogen bonds, salt bridges, TM rearrangement, and perhaps recep- 
tor oligomerization (4). 
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Deciphering the Message in Protein Sequences: 
Tolerance to Amino Acid Substitutions 
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n amino acid sequence encodes a message that deter- 
ines the shape and function of a protein. This message is 
ghty degenerate in that many different sequences can 
de for proteins with essentially the same structure and 
tivity. Comparison of different sequences with similar 
essages can reveal key features of die code and improve 
iderstanding of how a protein folds and how it per- 
rms its function. 



rHE GENOME IS MANIFEST LARGELY IN THE SET OF PRO- 
tcins that it encodes. It is the ability of these proteins to fold 
into unique three-dimensional structures that allows them to 
iction and carry out the instructions of the genome. Thus, 
nprchending the rules that relate amino acid sequence to struc- 
e is fundamental to an understanding of biological processes, 
cause an amino acid sequence contains all of the information 
ressary to determine the structure of a protein (f), it should be 
ssible to predict structure from sequence, and subsequently to 
br detailed aspects of function from die structure. However, both 
)blems are extremely complex, and it seems unlikely that either 
1 be solved in an exact manner in the near future. It may be 
isible to obtain approximate solutions by using experimental data 
simplify the problem. In this article, wc describe how an analysis 
allowed amino acid substitutions in proteins can be used to 
ucc the complexity of sequences and reveal important aspects of 
icturc and function. 



ethods for Studying Tolerance to 
quence Variation 

rhcre arc two main approaches to studying the tolerance of an 
ino acid sequence to change. The first method relics on the 
cess of evolution, in which mutations arc either accepted or 
cted by natural selection. This method has been extremely 
vcrful for proteins such as the globins or cytochromes, for which 
uenccs from many different species are known (2-7). The second 
•roach uses genetic methods to introduce amino acid changes at 
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specific positions in a cloned gene and uses selections or screens to 
identify functional sequences. This approach has been used to great 
advantage for proteins that can be expressed in bacteria or yeast, 
where the appropriate genetic manipulations arc possible (3, 8-11). 
The end results of both methods arc lists of active sequences that can 
be compared and analyzed to identify sequence features that are 
essential for folding or function. If a particular property of a side 
chain, such as charge or size, is important at a given position, only 
side chains that have the required property will be allowed. Con- 
versely, if the chemical identity of the side chain is unimportant, 
then many different substitutions will be permitted. 

Studies in which these methods were used have revealed that 
proteins arc surprisingly tolerant of amino acid substitutions {2-4, 
11). For example, in studying the effects of approximately 1500 
single amino acid substitutions at 142 positions in lac repressor, 
Miller and co-workers found that about one-half of all substitutions 
were phenotypically silent {it). At some positions, many different, 
nonconscrvative substitutions were allowed. Such residue positions 
play little or no role in structure and function. At other positions, no 
substitutions or only conservative substitutions were allowed. These 
residues arc the most important for lac repressor activity. 

What roles do invariant and conserved side chains play in 
proteins? Residues that are directly involved in protein functions 
such as binding or catalysis will certainly be among the most 
conserved. For example, replacing the Asp in the catalytic triad of 
trypsin with Asn results in a 10 4 -fold reduction in activity (72), A 
similar loss of activity occurs in X repressor when a DNA binding 
residue is changed from Asn to Asp (13). To carry out their 
function, however, these catalytic residues and binding residues 
must be precisely oriented in three dimensions. Consequently, 
mutations in residues that arc required for structure formation or 
stability can also have dramatic effects on activity {10, 14-16). 
Hence, many of the residues that arc conserved in sets of related 
sequences play structural roles. 



Substitutions at Surface and Buried Positions 

In their initial comparisons of the globin sequences, Pcrutz and 
co-workers found that most buried residues require nonpolar side 
chains, whereas few features of surface side chains arc generally 
conserved (6) . Similar results have been seen for a number of protein 
families (2, 4, 5, 7, 17, 18). An example of the sequence tolerance at 
surface versus buried sites can be seen in Fig. 1, which shows the 
allowed substitutions in X repressor at residue positions that are near 
the dimcr interface but distant from the DNA binding surface of the 
protein (P). These substitutions were identified by a functional 
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Rg- 2. Amino acid substitu- 
tions allowed in the core of X 
r fP rcssor - The wild-type side 
oums are shown pictorially in 
the approximate orientation 
«ccn in the crystal structure 
(«). The lists of allowed sub- 
swutions at each position arc 
»hown below the wild-type 
side chains. These substitu- 
tions were identified by ran- 
domly mutating one to four 
residues at a time by using a 
cassette method and apply™ 
» fonctionaJ selection (20) 
Not all substitutions are al- 
lowed in every sequence back- 
ground. 
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been used to combine such information into more appropriately 

were used to align the sequences of retroviral proteases with asoarric 
JS?/"*? m ^ ^tructionof a truJXcSn- 

S»£l£ Pr0t ri 0fhuman ^unodeficiency virus tyZ 1 
(29). Companson with the recently determined crystal stnictuTof 

The structural infonnation at most surface sites is highly degencr- 

Sm^ot tnrSrf ° B f J™? 0 **" -Wues, exteSl^S 
^tote unportant chiefly in maintaining a reasonably polar 
«^cc The mformation contained in buried residues is'aLo 
E th l! Mln ^uurment being that these residues remain 
hydrophobic. Thus, at its most bask level, th e key str£5 
message in an amino acid sequence may reside in its spe^ifi^ern 
of hydrophobK and hydrophilic residues. This isWb™ 
mfonn^onal sense. Clearly, the precise structure and stSty of^ 

Ke 2^ ° n ■? ,3IBC nUmbcr ° f da ^ d interactiorZk is 
E? r ' th , at u T CtUtal P^^on at a more primitive 
evel can be accomplished by concentrating on the most bask 
mfonmnonal aspects of an amino acid sequence. For «amDk 
amphipaduc patterns can be extracted from aSgncd^ £.22qS 
and u^, .n some cases, to identify secondary srn.ctures.^ 

of^rJT "y^Phobic residues reflecting the periodicity 

SSS^SVT * W *>■ TheseVerL can £ 

obscured m md,v,dual sequences by hydrophobic residues on the 

hvdronh^f 11 h0WCVer > for a su *ce position to remaS 
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£vT^° ? dCgrCC ° f ^Ph^diic character of a sequence in a 
given secondary structure {34). The amphipathic chapter of Ae 
three a-helical regions in the Antennac4, protein 5)5 clearfc 

S^"' C Strucmre of A« pressor, a small DNA- 

SSXK? Prcdiacd * a method (*/and 

contained by nuclear magnetic resonance studies (37) 

anarrlS" and hvdrophiic residues in 

SS^S?^ must ^*e number of different structured 
tr^T T ad ° Pt *"* ™ y indeed dcfi ™ ^ overall fold. If 
StaSr^SL anangCmC ? lt of phobic and hydrophJl c 
residues should be a characteristic feature of a particular fold Sweet 

ElSS^S Sh0Wn *■ thc correlation of i^S^TS 
hydrophobic^ between two protein sequences is a eoodi Sr^n„ 
for their structural relaredn«« rvn i j jT cntcnon 

Alrhough the pattern of sequence hydrophobicity may be a 

L™,, be used for prediction of structure de novo It is 

ETo h °" P 3RC - » sequence spaTcan 

related to structures in conformation space. Lau and Dill LZ 



possible. For longer sequences only a representative f£*J? 
allowed sequence orlonformation" spaTc^d £ IE*?? 
significant results were as follows: (i) not all sequerL^S 

217" ^? rc ^ ° n,y 3 fcW ^nceSTuTqu^v: 
^ructure; (,i) the probability that a sequence will adoptTuTuc 
native structure mcreases with chain length; and (iii) rheS 
sates are compact, contain a hydrophobic surro^L'Tar 

T"" 111 S,gnificant venire- AlthouK 

gap between these two-dimensional simulations and mree-dW 
sional structures is large, the use of simple rules and^e« 
representations yields results similar to those expected fa^S 

Summary 

" morc » fonnation in a set of related sequences than in a 
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analysis of the tolerance of residue positions to change First such 
mformaoon permits the evaluation of a residue's faiS^fi 
fo^caon and stability of a protein. This ability 

s^S ST? ° f 8 ^ ^ UCnCC ™y 4«ove our und£ 
tanding of the determinants of protein foldiAg and stability as well 
as protem function. Second, patterns of tolerance «o S add 

£v ro n hTh° f T^" 6 l * to l***y «• "CP » 

hkely to be boned in a protein structure and those likely to occupy 
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Rg. 4. Helical hydro- 
phobic moments calcu- 
lated by using (A) the 
Antcnnapedia homeodo- 
main sequence or (B) a 
set of 39 aligned homco- 
domain sequences (35). 
The bars indicate the ex- 
tent of the helical re- 
gions identified in nucle- 
ar magnetic resonance 
studies of the Antcnna- 
pedia homeodomain 
To determine hy- 
drophobic moments, 
residues were assigned 
to one of three groups: 
HI (high hydrophobici- 
ty = Trp Ik,Phe,Leu, 
Met, Vai, or Cys); H2 
(medium hydrophobic- 
"y^yr^Pro^Al^Thr, 
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His Gly, or Ser); and H3 (tow hydrophobicity = &~^£*jr ^ 
or Arg). For the aligned homco^m^ ic^c^ Ly t 
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FHj. 5. A representation of one com- 
pact conformation for a particular 
sequence of H and P residues on a 
two-dimensional square lattice. 
[Adapted from (40), with permit 
sion of the American Chemical Soci- 
ety] 
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surface positions. The amphipathic patterns that emerge can be used 
to identify probable regions of secondary structure. Third, incorpo- 
rating a knowledge of allowed substitutions can improve the ability 
to detect and align distantly related proteins because the essential 
residues can be given prominence in the alignment scoring. 

As more sequences arc determined, it becomes increasingly likely 
that a proton of interest is a member of a family of related 
sequences. If this is not the case, it is now possible to use generic 
methods to generate lists of allowed amino acid substitutions 
Corisequently, at least in the short term, it may not be necessary to 
soke the folding problem for individual protein sequences. Instead 
infomuDon from sequence sets could be used. Perhaps by simplify- 
ing sequence space through the identification of key residues, and by 
simplifying conformation space as in the lattice methods, it will be 
possible to develop algorithms to generate a limited number of trial 
structures. These trial structures could then, in turn, be evaluated by 
further experiments and more sophisticated energy calculations 
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